The Prague Bulletin of Mathematical Linguistics NUMBER ? ? ? JULY 2011 1 – 8 Metric combination for the Machine Translation optimisation tool MERT
نویسندگان
چکیده
The main metric used for SMT systems evaluation an optimisation is BLEU score but this metric is questioned about its relevance to human evaluation. Some other metrics already exist but none of them are in perfect harmony with human evaluation. On the other hand, most evaluations use multiple metrics (BLEU, TER, METEOR, etc.). Systems can optimise toward other metrics than BLEU. But optimisation with other metrics tends to decrease BLEU score. As Machine Translation evaluations still use BLEU as main metric, it is important to minimise the decrease of BLEU. We propose to optimise toward a metric combination like BLEUTER. This proposition includes two new open source scorers for MERT, the SMT optimisation tool. The first one is a TER scorer that allows us to optimise toward TER; the second one is a combination scorer. The latter one enables the combination of two or more metrics for the optimisation process. This paper also presents some experiments on the MERT optimisation in the Statistical Machine Translation system Moses with the TER and the BLEU metrics and some metric combinations. c © 2011 PBML. All rights reserved. Corresponding author: [email protected] Cite as: Christophe Servan, Holger Schwenk. Metric combination for the Machine Translation optimisation tool MERT. The Prague Bulletin of Mathematical Linguistics No. ???, 2011, pp. 1–8.
منابع مشابه
Optimising Multiple Metrics with MERT
Optimisation in statistical machine translation is usually made toward the BLEU score, but this metric is questioned about its relevance to an human evaluation. Many other metrics exist but none of them are in perfect harmony with human evaluation. On the other hand, most evaluation campaigns use multiple metrics (BLEU, TER, METEOR, etc.). Statistical machine translation systems can be optimise...
متن کاملMulti-Task Minimum Error Rate Training for SMT
We present experiments on multi-task learning for discriminative training in statistical machine translation (SMT), extending standardminimum-error-rate training (MERT) by techniques that take advantage of the similarity of related tasks. We apply our techniques to German-toEnglish translation of patents from 8 tasks according to the International Patent Classification (IPC) system. Our experim...
متن کاملZ-MERT: A Fully Configurable Open Source Tool for Minimum Error Rate Training of Machine Translation Systems
We introduce Z-MERT, a soware tool for minimum error rate training of machine translation systems (Och, 2003). In addition to being an open source tool that is extremely easy to compile and run, Z-MERT is also agnostic regarding the evaluation metric, fully configurable, and requires no modification to work with any decoder. We describe Z-MERT and review its features, and report the results of...
متن کاملImproved Minimum Error Rate Training in Moses
We describe an open-source implementation of minimum error rate training (MERT) for statistical machine translation (SMT). This was implemented within the Moses toolkit, although it is essentially standsalone, with the aim of replacing the existing implementation with a cleaner, more flexible design, in order to facilitate further research in weight optimisation. A description of the design is ...
متن کاملMargin Infused Relaxed Algorithm for Moses
We describe an open-source implementation of the Margin Infused Relaxed Algorithm (MIRA) for statistical machine translation (SMT). The implementation is part of the Moses toolkit and can be used as an alternative to standard minimum error rate training (MERT). A description of the implementation and its usage on core feature sets as well as large, sparse feature sets is given and we report exp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011